Evaluation of Classifiers to Enhance Model Selection
نویسنده
چکیده
Abstract— The various tasks like classification, clustering and association rule deriving are performed in the data-mining for the pattern extraction. The performance evaluation measures make each task distinct and meaningful. The plenty of machine learning algorithms helps in the different ways. The classification helps to predict about the future well in advance and make necessary actions thus it otherwise called as actionable data mining. In this paper we plan to give the overview about various classification algorithms by Waikato Environment for Knowledge Analysis otherwise shortly called as WEKA. The measures found in this helps to determine the best model and proposed statistical analysis namely the paired t-test to enhance the model selection. The evaluations make the promising environment for the model selection. KeywordsEvaluation; Accuracy; T-Test; Data Mining; Classification; WEKA; Stratified Cross Validation; ROC
منابع مشابه
Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets
Objective(s): This study addresses feature selection for breast cancer diagnosis. The present process uses a wrapper approach using GA-based on feature selection and PS-classifier. The results of experiment show that the proposed model is comparable to the other models on Wisconsin breast cancer datasets. Materials and Methods: To evaluate effectiveness of proposed feature selection method, we ...
متن کاملEvaluation of Classifiers in Software Fault-Proneness Prediction
Reliability of software counts on its fault-prone modules. This means that the less software consists of fault-prone units the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of software, it will be possible to judge the software reliability. In predicting software fault-prone modules, one of the contributing features is software metric by which one ...
متن کاملApplication of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملA Two-phase Model for Product Design Development and Evaluation and Supplier Selection in Product Configuration Change Process (Case study: SUPCO)
A supply chain is composed of a complex sequence of processing stages, ranging from raw materials supplies, parts manufacturing, components and end-products assembling, to the delivery of end products. In the context of supply chain management, supplier selection decision is considered one of the key issues faced by operations and purchasing managers to remain competitive. Therefore, requiremen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013